PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cagra.1508s0062.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Capsella
Family Trihelix
Protein Properties Length: 405aa    MW: 44393.7 Da    PI: 9.2985
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cagra.1508s0062.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix56.57.2e-1819108273
             trihelix   2 WtkqevlaLiearremeerlrrgk..................lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrts 73 
                          Wt +e++ Liea++ ++er  r++                   ++ +W+++ ++++++g+ rs++qC++kw+nl ++ykk++e e++r +
  Cagra.1508s0062.1.p  19 WTLNETMILIEAKKMDDERRMRRSiglpppeqqqdsrsssnkPAELRWKWIEDYCWRKGCMRSQNQCNDKWDNLMRDYKKVREYERRRVE 108
                          **************55555444433444444444555554449999****************************************9843 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500906.7711193IPR017877Myb-like domain
PfamPF138372.8E-1318108No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 405 aa     Download sequence    Send to blast
MADQSGGLVM MREYRKGNWT LNETMILIEA KKMDDERRMR RSIGLPPPEQ QQDSRSSSNK  60
PAELRWKWIE DYCWRKGCMR SQNQCNDKWD NLMRDYKKVR EYERRRVESS FSAAAGESSS  120
SSAAAGGETA PSYWKMEKSE RKERNLPSNM LPQTYQALFE VVESKTSLPS STAVTAAAAA  180
AAAAIGSGNG SGGGQLQKVL QQGLGFVVPK VHQIIQPPPV VVSLPPPPSQ PQPPPQPLPP  240
RPLLLPPPPP PSFHAQQILP TGNSSSDSDT SEYSDTSPAK RRRTMPTTTA GPSGGGSVEA  300
EEEEARRSKR DEETTVAVAL SRSVTMIANA IRESEERQDR RHKEVMSVQE RRLKIEESNV  360
EMNREGMNGL VEAINKLASS IFALASSSSS SSGRHSNQHQ GGPP*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006306413.10.0hypothetical protein CARUB_v10012336mg
TrEMBLR0GU310.0R0GU31_9BRAS; Uncharacterized protein
STRINGAT1G31310.11e-180(Arabidopsis thaliana)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM34932662
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G31310.11e-110Trihelix family protein